Value Back-Propagation versus Backtracking in Real-Time Heuristic Search

نویسندگان

  • Sverrir Sigmundarson
  • Yngvi Björnsson
چکیده

One of the main drawbacks of the LRTA* real-time heuristic search algorithm is slow convergence. Backtracking as introduced by SLA* is one way of speeding up the convergence, although at the cost of sacrificing first-trial performance. The backtracking mechanism of SLA* consists of back-propagating updated heuristic values to previously visited states while the algorithm retracts its steps. In this paper we separate these hitherto intertwined aspects, and investigate the benefits of each independently. We present backpropagating search variants that do value back-propagation without retracting their steps. Our empirical evaluation shows that in some domains the value back-propagation is the key to improved efficiency while in others the retracting role is the main contributor. Furthermore, we evaluate learning performance of selected search variants during intermediate trial runs and quantify the importance of loop elimination for such a comparison. For example, our results indicate that the firsttrial performance of LRTA* in pathfinding domains is much better than previously perceived in the literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Backtracking in Real-time Heuristic Search

Real-time heuristic search algorithms are suitable for situated agents that need to make their decisions in constant time. Since the original work by Korf nearly two decades ago, numerous extensions have been suggested. One of the most intriguing extensions is the idea of backtracking wherein the agent decides to return to a previously visited state as opposed to moving forward greedily. This i...

متن کامل

Bilateral Teleoperation Systems Using Backtracking Search optimization Algorithm Based Iterative Learning Control

This paper deals with the application of Iterative Learning Control (ILC) to further improve the performance of teleoperation systems based on Smith predictor. The goal is to achieve robust stability and optimal transparency for these systems. The proposed control structure make the slave manipulator follow the master in spite of uncertainties in time delay in communication channel and model pa...

متن کامل

Counting-Based Look-Ahead Schemes for Constraint Satisfaction

The paper presents a new look-ahead scheme for backtracking search for solving constraint satisfaction problems. This look-ahead scheme computes a heuristic for value ordering and domain pruning. The heuristic is based on approximating the number of solutions extending each partial solution. In particular, we investigate a recent partitionbased approximation of tree-clustering algorithms, Itera...

متن کامل

Using Expectation Maximization to Find Likely Assignments for Solving Constraint Satisfaction Problems

We present a new probabilistic framework for finding likely variable assignments in difficult constraint satisfaction problems. Finding such assignments is key to efficient search, but practical efforts have largely been limited to random guessing and heuristically designed weighting systems. In contrast, we derive a new version of Belief Propagation (BP) using the method of Expectation Maximiz...

متن کامل

New Look-Ahead Schemes for Constraint Satisfaction

This paper presents new look-ahead schemes for backtracking search when solving constraint satisfaction problems. The look-ahead schemes compute a heuristic for value ordering and domain pruning, which influences variable orderings at each node in the search space. As a basis for a heuristic, we investigate two tasks, both harder than the CSP task. The first is finding the solution with min-num...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006